Exploiting Local Structures with the Kronecker Layer in Convolutional Networks
نویسندگان
چکیده
In this paper, we propose and study a technique to reduce the number of parameters and computation time in convolutional neural networks. We use Kronecker product to exploit the local structures within convolution and fully-connected layers, by replacing the large weight matrices by combinations of multiple Kronecker products of smaller matrices. Just as the Kronecker product is a generalization of the outer product from vectors to matrices, our method is a generalization of the low rank approximation method for convolution neural networks. We also introduce combinations of different shapes of Kronecker product to increase modeling capacity. Experiments on SVHN, scene text recognition and ImageNet dataset demonstrate that we can achieve 3.3× speedup or 3.6× parameter reduction with less than 1% drop in accuracy, showing the effectiveness and efficiency of our method. Moreover, the computation efficiency of Kronecker layer makes using larger feature map possible, which in turn enables us to outperform the previous state-of-the-art on both SVHN(digit recognition) and CASIA-HWDB (handwritten Chinese character recognition) datasets.
منابع مشابه
Convolutional and recurrent neural networks
Convolutional neural networks (CNNs) are biologically-inspired variants of multi-layer perceptrons (MLPs). In biology, a visual cortex contains a complex arrangement of cells. These cells are sensitive to small subregions of the visual field. Inspired by the structure of visual cortices and cells, the notion of receptive fields and local filters are introduced as a core component of convolution...
متن کاملIntroduction to Convolutional Neural Networks
6 The convolution layer 11 6.1 What is convolution? . . . . . . . . . . . . . . . . . . . . . . . . . 11 6.2 Why to convolve? . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 6.3 Convolution as matrix product . . . . . . . . . . . . . . . . . . . 15 6.4 The Kronecker product . . . . . . . . . . . . . . . . . . . . . . . 17 6.5 Backward propagation: update the parameters . . . . . . . . ...
متن کاملDecision Support System for Age-Related Macular Degeneration Using Convolutional Neural Networks
Introduction: Age-related macular degeneration (AMD) is one of the major causes of visual loss among the elderly. It causes degeneration of cells in the macula. Early diagnosis can be helpful in preventing blindness. Drusen are the initial symptoms of AMD. Since drusen have a wide variety, locating them in screening images is difficult and time-consuming. An automated digital fundus photography...
متن کاملA hybrid EEG-based emotion recognition approach using Wavelet Convolutional Neural Networks (WCNN) and support vector machine
Nowadays, deep learning and convolutional neural networks (CNNs) have become widespread tools in many biomedical engineering studies. CNN is an end-to-end tool which makes processing procedure integrated, but in some situations, this processing tool requires to be fused with machine learning methods to be more accurate. In this paper, a hybrid approach based on deep features extracted from Wave...
متن کاملA New Method to Improve Automated Classification of Heart Sound Signals: Filter Bank Learning in Convolutional Neural Networks
Introduction: Recent studies have acknowledged the potential of convolutional neural networks (CNNs) in distinguishing healthy and morbid samples by using heart sound analyses. Unfortunately the performance of CNNs is highly dependent on the filtering procedure which is applied to signal in their convolutional layer. The present study aimed to address this problem by a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1512.09194 شماره
صفحات -
تاریخ انتشار 2015